Creating an Annotated Set of Medical Reports to Evaluate Information Retrieval Techniques
نویسندگان
چکیده
During the evaluation of Information Retrieval Algorithms, measurements are often made against a Gold Standard. Since the construction of such a Gold Standard requires considerable resources, a high validity and a high degree of reusability would be an advantage. From a scientific point of view, a high comparability between different Gold Standards would be advantageous, in order to enable the comparison of the results of different measurements. Unfortunately, the validity of such a standard can be negatively affected by many different factors and the reusability and comparability is often limited. In this article, we enumerate some problems, which can negatively affect the result. We show, by the creation of an annotated set of pathology reports, in which form these problems can emerge and how we have tried to minimize their influence.
منابع مشابه
Performance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature
Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...
متن کاملThe Feasibility Study of Launching Book Recommendation System on the Basis of a Lending and Selling System of e-Books and Digital Taktab
Background:The study was conducted to achieve three axes of goals (users, publishers and the system) by way of objectives related to: A) Users - measuring the level of their satisfaction with Taktab system and also use of various methods of data retrieval; B) Publishers - Measuring the level of their satisfaction with Taktab system and also their expectations of the existence of a recommending...
متن کاملMachine Translation on the Medical Domain: The Role of BLEU/NIST and METEOR in a Controlled Vocabulary Setting
The main objective of our project is to extract clinical information from thoracic radiology reports in Portuguese using Machine Translation (MT) and cross language information retrieval techniques. To accomplish this task we need to evaluate the involved machine translation system. Since human MT evaluation is costly and time consuming we opted to use automated methods. We propose an evaluatio...
متن کاملA Network Model Approach to Retrieval in the Semantic Web
While it is agreed that semantic enrichment of resources would lead to better search results, at present the low coverage of resources on the web with semantic information presents a major hurdle in realizing the vision of search on the Semantic Web. To address this problem, we investigate how to improve retrieval performance in settings where resources are sparsely annotated with semantic info...
متن کاملAnnotation for Information Extraction from Mammography Reports
Inter and intra-observer variability in mammographic interpretation is a challenging problem, and decision support systems (DSS) may be helpful to reduce variation in practice. Since radiology reports are created as unstructured text reports, Natural language processing (NLP) techniques are needed to extract structured information from reports in order to provide the inputs to DSS. Before creat...
متن کامل